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Normal  Mode  Calculation 


for  Methylated  Z-DNA  poly(dG-m^dC) • (dG-m^dC) 


X.M,  Hua  and  E.W.  Prohofsky 

Physics  Department,  Purdue  University 
West  Lafayette,  IN  47907 


Abstract 

^  Normal  modes  of  methylated^ Z-DNA  poly(dG-m^dC)  •  (dG-m?<IC) 
are  computed  by  helix  lattice  dynamics.  Good  agreement  with 
Raman  spectral  data  is  obtained.  We  discuss  improvements 
in  the  formulation  of  the  problem  which  allow  us  to  greatly 
reduce  the  size  of  the  matrix  used.  This  leads  to  greatly 

reduced  calculation  times.  The  improvements  come  from  using 

s 

knowledge  of  the  and  time  reversal  symmetries.  — ^  ^  f 
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I.  Introduction 


Currently  DNA  methylation  is  an  interesting  topic  in  biopolymer 

research.  Behe  and  Felsenfeld^discovered  that  the  B-Z  transition  of 

poly (dG-m^dC) • (dG-m^dC)  occurs  at  low  salt  concentration,  close  to  usual 

2 

physiological  conditions.  Holler  et  al.  methylated  poly(dG-dC) • (dG-dC) 

at  the  N-7  position  of  guanine.  They  confirmed  that  at  100%  methylation 

of  the  guanine  residues,  poly (n/dG-dC) . (m^dG-dC)  is  fully  converted  to 

Z  form  in  a  physiological  salt  solution.  Thus  methylation  could  efficiently 

affect  genetic  activities  of  DNA.  In  particular,  the  sequence  m^dC-dG  is 

3  4 

associated  with  systems  which  have  decreased  transcription  levels,  ’ 

Crystal  structure  of  (m^dC-dG)^  Z-DNA^  and  (GCm^  CCGGCC)  A^DNA^  have 

been  solved  by  x-ray  diffraction.  Slight  structural  differences  between  - 

methylated  and  unmethylated  DNA  have  J>een  observed'’  ^  and  have  been  used 

to  explain  the  electrostatic  stability  of  methylated  Z-DNA.^’^  Structural 

information  of  the  poly (dG-m^dC)  molecule  in  solution  have  been  obtained 

8 

in  both  B-  and  Z-conformations  from  electric  dichroism  measurements. 

Raman  spectroscopy  has  been  employed  to  investigate  the  temperature  dependence 

5  9 

of  the  B-Z  transition  in  poly(dG-m  dC) .  Vaccuum  UV  CD  spectrum  were  also 
obtained  experimentally.  The  results  are  consistent  with  the  suggestion 
that  the  low-  and  high-salt  Z-forms  are  comprised  of  different  proportions 
of  Zj—  and  Z^, ^-conformations. 

Lattice  dynamics  has  been  used  to  quantitatively  explore  the  far- 

infrared  absorption  spectrum  and  Raman  scattering  spectrum  of  A-,  B-  and 
LI 

Z-form  DNA.  In  this  calculation  we  use  lattice  dynamics  to  compute  the 


2 


normal  modes  of  Methylated  Z-DNA  poly(dG-m^dC) • (dG-m^dC) .  The  internal 

force  constant  formalism  is  rearranged  and  Coulomb  potential  energy  is 

reformulated  in  a  precise  fashion.  The  C2  symmetry  of  Z-form 

poly(dG-m^dC) * (dG-m^dC)  is  emphasized.  A  brief  discussion  about  C ^ 

symmetry  is  given.  This  rearrangement  helps  us  to  greatly  reduce  the 

computer  run  time.  The  results  of  a  normal  mode  analysis  and  assignments 

9 

are  compared  with  experimental  data.  A  reasonably  good  agreement  between 
them  is  obtained. 


II.  Theoretical  Formalism 

Let  us  consider  an  infinite  double-helical  molecule.  The  helical 
symmetry  axis  is  taken  as  Z  axis  in  cartesian  coordinates.  Helical 
symmetry  implies  that  each  unit  cell  is  transformed  geometrically  into 

P;i 

the  next  by  the  operation  H(p ,/]  where  p  is  the  pitch  along  Z-axis  and 
f'.i  ‘  P?. 

jif  is  the  pitch  angle  about  Z-axis. ^4J(p  ,/)  is  thus  a  translation  through  p 

plus  rotation  through  ip . 

In  helix  lattice  dynamics  all  interactions  between  atoms  are 
characterized  by  effective  force  constants  which  are  appropriate  second 
derivatives  of  the  true  potential  with  respect  to  pairs  of  degrees  of 
freedom  of  the  system.  These  are  most  concisely  expressed  as  functions  of 

V, 

the  difference  in  positions  of  pairs  of  atoms  and  we  call  this  difference 
of  positions  the  internal  coordinate  system.  These  internal  coordinate  force 
constants  can  be  further  broken  down  into  valence  force  constants,  hydrogen 

bond  force  constants,  and  non-bonded  force  constants  which  are  due  to 

L 

Van  der  Wall  and  electrostatic  interactions.  The  electrostatic  forces  dominate 
our  non-bonded  interactions.^  The  potential  energy  in  internal  coordinates  is 
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(2.1) 


where  St  is  the  internal  coordinate,  m  labels  what  we  will  call  the 
unit  set  of  internal  coordinates  and  t  labels  the  individual  internal 
coordinate  in  the  unit  set.  Since  atoms  in  one  unit  cell  of  the  repeating 
helix  Interact  directly  with  atoms  in  other  unit  cells  the  unit  set  of 
internal  coordinates  involves  atoms  in  more  than  one  unit  cell  and  the 
unit  set  is  distinct  from  the  coordinate  of  a  unit  cell.  In  particular 
one  needs  to  pay  attention  to  the  force  constants  of  atoms  that  cross 
the  boundaries  of  the  unit  cells. 

The  internal  coordinates  are  then  expanded  in  mass  weighted  Cartesian  (MWC) 
coordinates  as 


s".i 

C  Ki 


1  *mk.^k 

=  ®ti  qi 


(2.2) 


where  u*  ,  m^  is  the  mass  of  i1*1  atom  and  u^  is  the  displacement  of 

l1"*1  atom  in  unit  cell.  *s  is  a  vector  whose  form  depends  on  the 
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character  of  the  internal  coordinate  (stretch. . .etc. ) .  The  double  super 
and  subscripts  of  emphasize  that  *5  is  assigned  to  i1*1  atom  in 
unit  cell  and  t1*1  Internal  coordinate  in  m1"*1  unit  set. 


It  has  been  proved  that 


RmgO  k-m 


where  R  is  the  rotation  through  helical  angle  tfi, 


(2.3) 


cosijj  -siniji 

R  -  sln*k  cos <1/ 


(2.4) 
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Using  the  Born- Von  Karman  boundary  condition  one  could  write 


71 

=  — -  d8Rk  q  (6)  e 

1  JfZ  1 


(2.5) 


q^-6)  =  q*(6) 


Substituting  (2.3)  and  (2.5)  into  (2.1)  and  noting  that 
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one  can  formulate  as 


(2.7) 
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(2.8) 

With  the  condition  k,H  =  0,  +1  and  k-£  *  0,  +  1  which  results  from  the 
local  properties  of  internal  forces. 

In  previous  work  the  electrostatic  forces  were  treated  as  simple 
stretch  forces.  It  is  possible  to  treat  them  more  exactly  without  this 
implicit  simplification  as  follows.  In  the  harmonic  approximation  the 
Coulomb  potential  energy  could  be  written  as 


i)  r 

C  kl  ij  3  \*t  -  Tj 


(u[-u*)}  (2.9) 


where  3  -  u^  Is  the  position  of  i ^  atom  in  unit  cell  provided 

— ►k 

is  the  equilibrium  position  and  u^  is  the  displacement,  e^  is  atomic  partial 
charge  assigned  to  atom  i  and  is  local  dielectric  function  in  the  neighbor¬ 
hood  of  atom  i. 


Eq.(2.8)  could  be  expressed  in  MWC  coordinate  as 
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where 
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Using  (2.3)— (2.6)  leads  to 
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The  kinetic  energy  T  is 
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The  Lagrangian 


leads  to  the  secular  equation: 
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where  the  harmonic  behavior  e  is  assumed  and  the  dynamical  matrix  D 


is  given  by 


r  R-k  _i_  f  $>*■  RV(k'*)0 

J,  O* 

+  E  ei*h 

“  i*Ki3  11 

+  t _ !i!i_  -r  +  3  W  e-i« 
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(2.15) 


Since  potential  energy  is  a  real  quantity,  D  should  be  hermitian: 


Du  '  V 


(2.16) 


III.  Solution  of  Secular  Equation 

We  are  dealing  with  a  double  helical  molecule  with  C 2  symmetry.  Once 
we  pick  the  0tkl  unit  cell  by  convention  the  x-axis  is  chosen  as  the 
symmetry  axis.  Even  for  those  double  helical  molecules  without  symmetry 
the  backbone  atoms  still  have  symmetry  and  the  x-axis  is  the  same. 

In  such  a  cartesian  coordinate  system  the  C ^  symmetry  implies  that  if  the 
equilibrium  position  of  i^  atom  on  one  strand  in  N*”*1  unit  cell  is  (x,y,z), 
then  the  corresponding  atom  on  the  antiparallel  strand  in  -N^  unit  cell 
is  at  (x,-y,-z). 

For  the  molecule  (dG-ra^dC) • (dG-m^dC)  the  number  of  atoms  on  one  strand 
Jn  a  unit  ceil  is  42  (without  counting  the  Hydrogen  atoms).  The  dynamical 
matrix  in  (2.14)  is  a  252x252  matrix.  In  the  following  we  show  how 
to  use  C2  symmetry  to  simplify  the  calculation.  Only  the  main  procedures  will 
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be  mentioned  here.  The  mathematical  circumstances  are  referred  to  In 
14 

a  later  manuscript. 

It  is  convenient  for  Cg  symmetry  discussion  to  rewrite  the 
secular  equation  as  the  following: 


2  S^-qjO)  +  E  D^’q^0)  = 


E  °kj'Ve)  +  ^  “  w\(6)  » 

k  I 


(3.1a) 


(3.1b) 


where  i,j  =  1,  n;  k  =  i+n,  l  =  j+n  and  n  is  the  number  of  atoms  on  one 
strand  in  a  unit  cell.  For  methylated  Z-DNA  (dG-m^dC) • (dG-m^dC)  n  =  42, 
for  unmethylated  Z-DNA  (dG-dC) • (dG-dC)  n  =  41. 

The  invariance  of  the  potential  energy  under  a  operation  implies 
that  the  dynamical  matrix  should  have  the  following  property 


if  D, 


if  D. 


x  x  x 


=  X  x  x 


X  X  X) 


X  X  X 


=  I  X  X  X 


X  X  X 


*  *  * 
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*  *  * 

-xxx  (3.2a) 
*  *  * 

-X  X  X 


*  *  * 

-x  x  x  (3.2b) 

*  *  * 

-xxx 


At  Brillouin  zone  center  (6  =  0)  both  D  and  q  are  real.  Two  independent 
types  of  eigenvector  exist,  they  are  either  symmetric  or  antisymmetric  under 
the  C2  operation.  The  symmetric  eigenvector  has  the  same  structure 


Pi 


m 


$ 


as  the  system: 


lf  qi  =  (qix*  qiy*  qi2h  then  qk  =  (qix’  -qiy’  -qiZ>-  <3'3 


iy’ 


And  the  antisymmetric  eigenvector  has  the  opposite  structure: 


lf  qi  =  (qix*  qiy*  qiZ}*  then  qk  =  (_qix’  qiy ’  qiZ)* 


iy’ 


(3.31 


That  (3.3a)  and  (3.3b)  are  solutions  of  (3.1)  can  be  seen  by  inserting 
(3.3)  into  (3.1),  and  confirming  that  (3.1a)  and  (3.1b)  are  the  same  equations 
Thus  at  0  =  0  the  C2  symmetry  allows  us  to  break  a  (252x252)  matrix 
into  two  (126x126)  matrices,  one  for  the  symmetric  eigenvectors 


2+ 


<*>  q. 


(3.4; 


And  one  for  antisymmetric  eigenvectors 


The  symbol  +  is  defined  as  follows 
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(3.6) 


and  +  =  -  +  . 

One  could  show  that  a^  and  are  still  symmetric  matrices. 

At  9  Is  0  we  are  no  longer  able  to  separate  the  eigenvectors  into 

symmetric  or  antisymmetric  types,  for  if  q  is  an  eigenvector  then,  after 

-+■  1* 

multiplying  by  a  common  arbitrary  phase  factor  qe  is  still  an  eigenvector 
corresponding  to  same  eigenvalue.  Optionally  one  can  show  that 


if  qi  =  (qix*  qiy’  qlz>’  then  qk  "  (qtx*  -qiy  ’  -qiz); 


(3.7a) 


or 


lf  qi  '  (V'  V  “l,1'  then  \  '  (-’L'  qiy’  O 

(3.7b) 

And  it  is  not  difficult  to  prove  that  (3.7a)  or  (3.7b)  is  the  solution  of 
(3.1),  but  (3.7a)  and  (3.7b)  are  no  longer  independent.  We  can  also  choose 
either  (3.7a)  or  (3.7b)  as  the  optional  solution  of  (3.1).  C2  symmetry 
allow  us  to  transfer  a  hermitian  matrix  D  to  a  real,  symmetric  matrix  G 


and  (3.1)  becomes 


where 


- 

+ 

V 

(3.9a) 
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Gkj 

■  '"'Ki 
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(3.9c) 

Uk2 

■  ReIV 
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(3.9d) 

5;- 

Re  (q  i ) 

(3.9e) 

& 

II 

im(qk) 

(3.9f) 

here  the  optional  solution  (3.7a)  has  been  used. 

The  result  of  the  reduction  in  size  of  the  matrix  allows  a  reduction 


in  computing  time  by  a  factor  of  ten  in  normal  mode  calculations. 
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IV.  Results  and  Discussion 

We  have  taken  atomic  coordinates  for  our  model  of  an  infinite 

helix  from  the  x-ray  crystal  structure  analyses  for  the  system 

(m^dC-dG)^  We  have  ignored  the  slight  difference  in  pitch  angles 

so  as  to  impose  helical  symmetry  on  the  system.  The  force  constants 
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used  have  been  those  refined  in  our  previous  models  of  DNA,  with 
the  addition  of  force  constants  for  the  methyl  group  which  were  not 
present  in  the  earlier  models.  The  values  for  the  force  constants  used 
for  the  methyl  group  are  listed  in  Table  I.  These  are  the  same  as  that 
used  for  the  cytosine  group  in  our  earlier  models.  After  an  initial 

calculation  of  the  resulting  frequencies  we  did  a  minimal  refinement  of 
our  force  constants  so  as  to  get  a  better  fit  to  the  experimentally  observed 

Q 

Raman  lines.  The  refinement  required  only  a  one  percent  change  in  force 
constant  values.  The  calculated  frequencies  displayed  are  the  zero  center 
(')  =  0)  modes. 

To  compare  the  theoretical  modes  with  those  observed  and  assigned 
experimentally  we  calculated  the  kinetic  energy  distribution  (KED)  of  our  modes. 
It  Is  traditional  to  use  potential  energy  distributions  (PED)  for  assignment. 

It  is;  far  easier  to  calculate  KED's  and  they  are  useful  in  assigning  modes. 

The  kinetic  energy  is  associated  with  atom  motion,  not  with  energy  stored  in 
a  bond  between  atoms.  The  total  kinetic  energy  of  the  system  is 


w-  V\  <r. 


at  0  =  0,  the  kinetic  energy  of  a  group  of  atoms,  say  the  guanine  base 
is  the  kinetic  energy  per  atom  summed  over  the  atoms  in  the  guanine 


group  i.e. 


^  1  2  _  -+t  -*■ 

tG  -  "  2  “  1  Vqi 

G 


Thus  the  percentage  of  kinetic  energy  stored  in  guanine  atoms  is  Tq/T. 

If  this  percentage  is  high,  the  normal  mode  is  assigned  to  guanine. 

The  results  of  this  calculation  and  its  comparison  with  experimental 
9 

data  is  displayed  in  Table  II,  where  backbone  1  groups  those  backbone 
atoms  that  are  covalently  connected  to  guanine,  and  backbone  2,  to  m^C 

The  overall  agreement  with  experiment  is  fairly  good  and  can  of  course 
be  made  much  better  by  further  refinement.  The  point  to  be  made  here  is 
that  good  agreement  can  be  found  for  problems  in  which  new  entities  are 
added  to  the  basic  helix  without  starting  from  scratch  and  doing  massive 
refinement  of  force  constants  to  achieve  a  good  fit.  The  helix  lattice 
method  is  predictive  of  the  frequencies  of  modified  DNA  on  the  basis  of  the 
theoretical  calculations. 
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